Induction of Quadratic Decision Trees using Genetic Algorithms and k-D Trees
نویسندگان
چکیده
Genetic Algorithm-based Quadratic Decision Tree (GA-based QDT) has been applied successfully in various classification problems with non-linear class boundaries. However, the execution time of GA-based QDT is quite long. In this paper, a new version of GA-based QDT, called Genetic Algorithm-based Quadratic Decision Tree with k-D Tree (GA-based QDT with k-D Tree), is proposed. In the proposed algorithm, a k-D tree is constructed at each non-leaf node of a decision tree to evaluate the impurity reduction due to a quadratic hypersurface. Experiments show that the proposed algorithm is faster than GA-based QDT on datasets with lower dimensionality or the number of training samples is sufficiently large, without sacrificing the quality of a quadratic decision tree. Although the Gini index is used to measure the impurity of a set of samples, various impurity measures including entropy and the Twoing value can also be used in the proposed algorithm. Key-Words: Decision Trees, Genetic Algorithms, k-D Trees, Quadratic Hypersurface, Impurity, Data Mining
منابع مشابه
An Efficient Predictive Model for Probability of Genetic Diseases Transmission Using a Combined Model
In this article, a new combined approach of a decision tree and clustering is presented to predict the transmission of genetic diseases. In this article, the performance of these algorithms is compared for more accurate prediction of disease transmission under the same condition and based on a series of measures like the positive predictive value, negative predictive value, accuracy, sensitivit...
متن کاملPredicting The Type of Malaria Using Classification and Regression Decision Trees
Predicting The Type of Malaria Using Classification and Regression Decision Trees Maryam Ashoori1 *, Fatemeh Hamzavi2 1School of Technical and Engineering, Higher Educational Complex of Saravan, Saravan, Iran 2School of Agriculture, Higher Educational Complex of Saravan, Saravan, Iran Abstract Background: Malaria is an infectious disease infecting 200 - 300 million people annually. Environme...
متن کاملA New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملEfficient Non-greedy Optimization of Decision Trees
Decision trees and randomized forests are widely used in computer vision and machine learning. Standard algorithms for decision tree induction optimize the split functions one node at a time according to some splitting criteria. This greedy procedure often leads to suboptimal trees. In this paper, we present an algorithm for optimizing the split functions at all levels of the tree jointly with ...
متن کاملدستهبندی دادههای دوردهای با ابرمستطیل موازی محورهای مختصات
One of the machine learning tasks is supervised learning. In supervised learning we infer a function from labeled training data. The goal of supervised learning algorithms is learning a good hypothesis that minimizes the sum of the errors. A wide range of supervised algorithms is available such as decision tress, SVM, and KNN methods. In this paper we focus on decision tree algorithms. When we ...
متن کامل